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■ Abstract 

p/| This paper proves that any monoid presented by a confluent 

^ I context-free monadic rewriting system is word-hyperbolic. This 

result then applied to answer a question asked by Duncan & 
Gilman by exhibiting an example of a word-hyperbolic monoid 
that does not admit a word-hyperbolic structure with uniqueness 
(that is, in which the language of representatives maps bijectively 
onto the monoid). 
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1 Introduction 



> 

Hyperbolic groups — groups whose Cayley graphs are hyper- 
bolic metric spaces — have grown into one of the most fruitful areas of group 
CN ■ theory since the publication of Gromov's seminal paper [Gro87]. The concept 

of hyperbolicity generalizes to semigroups and monoids in more than one 
way. First, one can consider semigroups and monoids whose Cayley graphs 
are hyperbolic [Cai, CS09]. Second, one can use Gilman's characterization of 
hyperbolic groups using context-free languages [GH02]. This characterization 
says that a group G is hyperbolic if and only if there is a regular language L 
(over some generating set) of normal forms for G such that the language 

M(L) = {u# 1 v# 2 w rev :u,v,w E LAuv = G w} 

(where w rev denotes the reverse of w) is context-free. (The pair (L, M(L)) is 
called a word-hyperbolic structure.) Duncan & Gilman [DG04] pointed out that 
this characterization generalizes naturally to semigroups and monoids. The 
geometric generalization gives rise to the notion of hyperbolic semigroup; the 
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linguistic one to the notion of word-hyperbolic semigroups. While the two no- 
tions are equivalent for groups [DG04, Corollary 4.3] and more generally for 
completely simple semigroups [FK04, Theorem 4.1], they are not equivalent 
for general semigroups. This paper is concerned with word-hyperbolic semi- 
groups. 

Some of the pleasant properties of hyperbolic groups do not generalize 
to word-hyperbolic semigroups. For example, hyperbolic groups are always 
automatic [ECH+92, Theorem 3.4.5]; word-hyperbolic semigroups may not 
even be asynchronously automatic [HKOT02, Example 7.7]. On the other hand, 
word-hyperbolicity for semigroups is independent of the choice of generat- 
ing set [DG04, Theorem 3.4], unlike automaticity for semigroups [CRRT01, 
Example 4.5]. 

Duncan & Gilman [DG04, Question 2] asked whether every word-hyper- 
bolic monoid admits a word-hyperbolic structure where the language of rep- 
resentatives L projects bijectively onto the monoid. By analogy with the case of 
automatic groups [ECH+92, § 2.5] and semigroups [CRRT01, p. 380], such a 
word-hyperbolic structure is called a word-hyperbolic structure with uniqueness. 
The question also applies to semigroups; Duncan & Gilman have a particular 
interest in the situation for monoids because a positive answer in that case 
would imply that the class of word-hyperbolic semigroups is closed under 
adjoining an identity. 

As explained in Subsection 2.1 below, every hyperbolic group admits a 
word-hyperbolic structure with uniqueness. Furthermore, every automatic 
semigroup admits an automatic structure with uniqueness [CRRT01, Corol- 
lary 5.6]. 

The main goal of this paper is to give a negative answer to the question 
of Duncan & Gilman by exhibiting an example of a word-hyperbolic monoid 
that does not admit a word-hyperbolic structure with uniqueness (Example 
4.2). En route, however, a result of independent interest is proven: that any 
monoid presented by a confluent context-free monadic rewriting system is 
word-hyperbolic (Theorem 3.1). 



2 Preliminaries 

This paper assumes familiarity with regular languages and finite 
automata and with context-free grammars and languages; see [HU79, Chs 2-4] 
for background reading and for the notation used here. 
The empty word (over any alphabet) is denoted e. 

2. 1 Word-hyperbolicity 

Definition 2.1. A word-hyperbolic structure for a semigroup S is a pair (L, M(L)), 
where L is a regular language over an alphabet A representing a finite gener- 
ating set for S such that L maps onto S, and where 

M(L) = {u#i v#2W rev : u,v, w € L A uv =s w} 

(where #1 and #2 are new symbols not in A and w iev denotes the reverse of 
the word w) is context-free. The pair (L, M(L) is a word-hyperbolic structure 
with uniqueness if L maps bijectively onto S; that is, if every element of S has a 
unique representative in L. 
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A semigroup is word-hyperbolic if it admits a word-hyperbolic structure. 

A group is hyperbolic in the sense of Gromov [Gro87] if and only if it 
admits a word-hyperbolic structure ([G1I02, Theorem 1] and [DG04, Corol- 
lary 4.3]). Furthermore, every group admits a word-hyperbolic structure with 
uniqueness: if (L, M.(L)) is a word-hyperbolic structure for a group G, then 
the fellow-traveller property is satisfied [DG04, Theorem 4.2] and so L forms 
part of an automatic structure for G [ECH + 92, Theorem 2.3.5]. Therefore 
there exists an automatic structure with uniqueness for G, where the lan- 
guage of representatives L' is a subset of L [ECH + 92, Theorem 2.5.1]. Hence 
(L',M(L) (~l L'#i L'#2(L') rev ) is a word-hyperbolic structure with uniqueness 
for G. 

2.2 Rewriting systems 

This subsection contains facts about string rewriting needed later 
in the paper. For further background information, see [BO93]. 

A string rewriting system, or simply a rewriting system, is a pair (A, 31), where 
A is a finite alphabet and 3? is a set of pairs [t, r), known as rewriting rules, 
drawn from A* x A*. The single reduction relation is defined as follows: 
u- =^ir v (where u, v e A*) if there exists a rewriting rule (■£, r) e 31 and 
words x, y € A* such that u = x£y and v = xry. That is, u =^ v if one can 
obtain v from u by substituting the word r for a subword i of u, where (£, r) 
is a rewriting rule. The reduction relation is the reflexive and transitive 
closure of The process of replacing a subword £ by a word r, where 

(•£, r) € 31, is called reduction, as is the iteration of this process. 

A word w € A* is reducible if it contains a subword I that forms the left- 
hand side of a rewriting rule in 31; it is otherwise called irreducible. 

The string rewriting system (A, 31) is noetherian if there is no infinite se- 
quence Ui,U2,... G A* such that ut =^ Ui + i for all i € N. That is, (A, 31) 
is noetherian if any process of reduction must eventually terminate with an 
irreducible word. The rewriting system (A, 31) is confluent if, for any words 
u, u^u" G A* with u u' and u u", there exists a word v € A* such 
that u' v and u" v. 

The string rewriting system (A, 31) is length-reducing if (£, r) € 31 implies 
that \l\ > |r|. Observe that any length-reducing rewriting system is necessarily 
noetherian. The rewriting system (A, 31) is monadic if it is length-reducing and 
the right-hand side of each rule in 31 lies in A U {e}; it is special if it is length- 
reducing and each right-hand side is the empty word e. Observe that every 
special rewriting system is also monadic. 

A special or monadic rewriting system (A, 31) is context-free if, for each 
a G A U {e}, the set of all left-hand sides of rules in 31 with right-hand side a is 
a context-free language. 

Let (A, 31) be a confluent noetherian string rewriting system. Then for any 
word u S A*, there is a unique irreducible word v € A* with u v [BO93, 
Theorem 1.1.12]. The irreducible words are said to be in normal form. The 
monoid presented by (A | 31} may be identified with the set of normal form 
words under the operation of 'concatenation plus reduction to normal form'. 

The subscript symbols in the derivation and one-step derivation relations 
=4^ =>k for a rewriting system 31 are never omitted in this paper, in order 
to avoid any possible confusion with the derivation and one-step derivation 
relations =>p and =>r for a context-free grammar P. 
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3 Monoids presented by confluent context-free monadic 
rewriting systems 

Theorem 3.1. Let (A, !R) be a confluent context-free monadic rewriting system. Then 
(A*, M(A*)) is a word-hyperbolic structure for the monoid presented by (A | 3?). 

Proof. Let M be the monoid presented by (A | ft). Let 

K = {u# 2 v rev : u, v G A*, u = M v}. 

Let (J) : (A U {#! , # 2 })* -^(AU {#1 })* be the homomorphism extending 

#i H> e, #2 H> #2, a H> a for all a G A. 

Then M(A*) = Kc^ 1 n A*#tA*# 2 A*. Since the class of context-free lan- 
guages is closed under taking inverse homomorphisms, to prove that M(A*) 
is context-free it suffices to prove that K is context-free. 
For each a G A U {e}, let $ a and $ Q be new symbols. Let 

$au{e} = {$ Q : a G A U {e}} $ A = {$a : a G A}, 

$au{ £ } = {$a : a G AU{e}} $ A ={$ Q :aGA}, 

and for any word w = wi • • • w n with Wj G A, let $ w and $ w be abbreviations 
for $ W1 • • • $ Wn and $ Wl • • • $ Wn respectively. 

For each a G A U {e}, let Y a = (N a , A, P a , O a ) be a context-free grammar 
such that L(F a ) is the set of left-hand sides of rewriting rules in 3i whose right- 
hand side is a. Since 3i is length-reducing, no L(T a ) contains e. Therefore 
assume without loss of generality that no V a contains a production whose 
right-hand side is £ [HU79, Theorem 4.3]. 

Modify each P Q by replacing each appearance of a terminal letter b G A 
in a production by $t,; the grammar = (N^, $au{e}> Pq> Oa) thus formed 
has the property that w G L(T a ) if and only if $ w G L(P^). Modify each F a 
by reversing the right-hand side of every production in P a and by replacing 
each appearance of a terminal letter b G A in a production by $b ; the grammar 
1^' = (N $aue , Pq, Oa) thus produced has the property that w G L(P Q ) if and 
only if $ w rev g L(0 

The language 

{$p# 2 Vv : p G A*} 

is clearly context-free. (Notice that $ p can either be an abbreviation for a 
non-empty word $ p , • • • $ Pk or the single letter $ £/ and similarly for $ p rev.) 
Let A = (Na,$a U $a{#2}> Pa> Oa) be a context-free grammar defining this 
language. Assume without loss of generality that the various non-terminal 
alphabets N a , N^' and Na are pairwise disjoint. 

Define a new context-free grammar & = (No, A U {#2}, P©, Oa) by letting 

N = N a U$au{ £ }U$au{ £ }U |J (N^UN£), 

QEAUfe} 

and 

P© = Pa U [ U ( p a U K) 

QGAU{e) 

U {$q — > $a$£)$a ~~ ^ $£$a> $a ~~ ^ $a$£> $a ~~ t $ £ $a : CL G AU {e}} (3- 1 ) 

u{$ a 40 a) $ a 40^:a£AU{£}} (3.2) 
u{$ Q ^a,$a^a:aGAU{e}}. (3.3) 
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Notice that elements of $au{e} now play the role of non-terminals, while in 
the various grammars and V£ , they were terminals. Notice further that the 
start symbol of © is Oa- 

The aim is now to show that L(O) = K. 

Lemma 3.2. Ifw£ L(0), then w = u#2V rev for some u,v G A*, and there exists 
some p G A* such that $ p u and $ p rev =>q v rev . 

Proof. Let w G L(O). Then Oa =^e w ' anc ^ * ne ^ rs ^ production applied is from 
Pa- Since no production in Pe — Pa introduces a non-terminal symbol from 
Na, assume that all productions from Pa in the derivation of w are carried 
out first, before any productions from Pe — Pa- This shows that there is some 
word q G L(A) such that Oa => q =^ w. By the definition of A, it follows 
that q = $p# 2 $ p rev with 

Oa =^e> $p#2$p rev =^0 w - 

Since symbols from $au{e} u $au{e} can ultimately only derive symbols from 
A (and not the symbol #2), it follows that there exist u,v G A* with $ p =^g, u 
and $ p rev =^ v rev such that w = u# 2 v rev . □ 

Lemma 3.3. Let w,u G A*. Ifw u, then $ u =>q $ w . 

Proof. Suppose 

W = W =^IR W] w 2 . . . w n = u 

is a sequence of rewriting of minimal length from w to u. 

Proceed by induction on n. If n = 0, it follows that w = u and there is 
nothing to prove. So suppose n > and that the result holds for all shorter 
such minimal-length rewriting sequences. Then wo =>swi, and so wq = xly 
and wi = xay for some x,y G A*, a G A U {e}, and (£, a) G ft. So I G L(P a ). 
Hence, first applying a production of type (3.2), the construction of V' a and the 
inclusion of all its productions in shows that 

$a=>@0' a =>* @ $ t . (3-4) 

By the induction hypotheses, $ u =>q $ Wi . Now consider the cases a G A and 
a = e separately: 

1. a G A. Then $ Wl = $ x $ a $y and so 

$u =^0 $w, (by the induction hypothesis) 

— $x$a$tj 

$M y (by (3.4)) 

— $wo 

— $w • 

2. a = e. Then $ Wl = $ x $ y and so by (3.4), 

$u =^0 $wi (by the induction hypothesis) 

=^0 $x$y 

^0 $x$a$y (by (3.1)) 

^0 $x$s$ y (by (3.4)) 

— $wo 
= $w 
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This completes the proof. 



□ 



Lemma 3.4. Let u, w € A*. If $ u =>q $ W/ fen w u - 

Proof. The strategy is to proceed by induction on the number n of productions 
of type (3.1) or (3.2) in the minimal-length derivation of $ w from $ u . 

Suppose such a minimal length derivation involves a production $ £ — > e 
(of type (3.3)). If this symbol $ £ is introduced by a production of type (3.1), 
then the derivation would not be of minimal length. So this symbol $ £ must 
be present in $ U/ which, by the definition of the abbreviation $ u requires 
u = e. But this would mean that the derivation produced e, which contradicts 
the hypothesis of the lemma. So the derivation does not involve productions 
$ £ -> e. 

The only productions where symbols from $au{e} appear on the left-hand 
side are of types (3.1), (3.2), and (3.3). Since there are no productions $ £ — > 
z, any production of type (3.3) would produce a terminal symbol, which is 
impossible. So the first production applied in the derivation sequence must 
be of type (3.1) or (3.2). 

Suppose first that n = 0. Then there is no possible first production and 
thus $ w = $ u , which entails w = u and so there is nothing to prove. 

Suppose now that n > and that the result holds for all shorter such 
minimal-length derivations. Consider cases separately depending on the whether 
the first production applied in the derivation is of type (3.1) or (3.2).: 

1. Type (3.1). So $ u = $ x $ y =>e $ x $e$y for some x,y e A* with xy = u. 
The symbol $ £ thus produced does not derive e since no production 
$ £ — > e is involved. So $ x =>0 $ w /, $ £ $ w «, $ y $ W '», where 
w = w'w"w"', where w', w'" G A* and w" £ A + and all three of these 
derivations involve fewer than n productions of type (3.1) or (3.2). By 
the induction hypothesis, w' x, w" e, and w'" y, and thus 
w = w'w"w'" =^ xy = u. 

2. Type (3.2). So $ u = $ x $ Q $ y =>e $ x O^$ y for some x,y £ A* with xay = 
u. Now, 0' a is the start symbol of Y' a , and L(P^) consists of words of the 
form $£ where i a. Thus 

$U =^0 SxO^Sy =>@ $ x $£$y =>Q $ w . 

Thus $ x $ w /, $ t $ w », and $ y ^* @ $ w m, where w>",w'" G A* 
are such that w = w'w"w'", and each of these derivation sequences 
involve fewer than n productions of type (3.1) or (3.2). Hence by the 
induction hypothesis, w' x, w" I, and W" =^ y. Therefore 

w = w'w"w"' xly =^ji xay = u. 

This completes the proof. □ 

Lemma 3.5. For any u, w € A* w =^1% u if and only if$ u w. 

Proof. Suppose w =^ u - Then $ u $ w by Lemma 3.3. By |w| applications 
of productions of type (3.3), $ w w. So $ u =4>q w. 

Suppose that $ u =>q w. Only productions of type (3.3) have terminals on 
the right-hand side. So $ u $ w =>q w. So by Lemma 3.4, w u. □ 
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Reasoning symmetric to the proofs of Lemmata 3.3, 3.4, and 3.5 establishes 
the following result: 

Lemma 3.6. For any u, w € A* w u if and only if $ u rev =^ w rev . 

Suppose u#2V rev G K. Then u,v G A* and u =m v. Therefore there is a 
normal form word p with u p and v =^ p. So by Lemmata 3.5 and 3.6, 
$p =>q u and $ p rev v rev . Since every production in Pa is included in Pe, it 
follows that 

Oa =>e $p#2$p rev > 

whence A =^q u# 2 v rev and so u# 2 v rev G L(B). 

Conversely suppose w G L(0). By Lemma 3.2, there are words u,v,p G A* 
with w = u#2V rev , $ p =>q u, and $ p rev =^ v rev . By Lemmata 3.5 and 3.6, it 
follows that u =^ p and v =^ p. So u =m v and thus w = u#2V rev G K. 

Hence L(O) = K. Thus K and so M(A*) are context-free. Therefore 
(A*, M(A*)) is a word-hyperbolic structure for the monoid M. □ 



4 Word-hyperbolic structures with uniqueness 

This section exhibits an example of a word-hyperbolic monoid 
that does not admit a word-hyperbolic structure with uniqueness. 

The following preliminary result, showing that admitting a word-hyper- 
bolic structure with uniqueness is not dependent on the choice of generating 
set, is needed. The proof is similar to that of the independence of word- 
hyperbolicity from the choice of generating set [DG04, Theorem 3.4], but the 
detail and exposition are different to make clear that uniqueness is preserved. 
Additionally, the result here also shows that whether one deals with monoid 
or semigroup generating sets is not a concern. 

Proposition 4.1. Let M be a monoid that admits a word-hyperbolic structure with 
uniqueness over either a semigroup or monoid generating set, and let A be a finite 
alphabet representing a semigroup or monoid generating set for M. Then there is a 
language L such that (A, L) is a word-hyperbolic structure with uniqueness for M. 

Proof. Suppose S admits a word-hyperbolic structure (B,K). For each b G B, 
let Ub G A* be such that Ub =m b. (If A represents a semigroup generating 
set, ensure that Ub lies in A + ; this restriction is important only if b is actually 
the identity.) Let 7 C B* x A* be the rational relation: 

?=({(b,u b ):bGB})* 

Notice that if (v, w) G then v =m w. 
Let 

L = K o y = {w G A* : (3v G K)((v, w) G O 5 )}; 

observe that L is a regular language. Notice that, by the definition of CP, for 
each word v in K there is exactly one word w G L with (v,w) G IP. Since for 
each x G M there is exactly one word v in K with v =jw x, it follows that 
there is exactly one word w G L with w =m x. That is, the language L maps 
bijectively onto M. 

Let Q be the rational relation 

?(#!,#! m#2,#2):P reV . 
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Then M(L) = M(K) o Q and so M(L) is a context-free language. 

Thus (A, L) is a word-hyperbolic structure for S in every case except when 
S is a monoid, A is a semigroup generating set, and the representative in K of 
the identity is e. In this case, let Li = (L— {e})U{e}, where e G A + representings 
the identity. Then Li is contained in A + and maps bijectively onto S. The 
language M(Li ) is context-free: a pushdown automaton recognizing it can be 
constructed from one recognizing M(L) by modifying it to read e instead of 
the empty word as one of the multiplicands or result while simulating reading 
the empty word whenever e is encountered. □ 

Example 4.2. Let A = {a,b,c, d} and let 31 = {(ab a c a d, e) : a € N}. Let M 
be the monoid presented by (A | 3?). Then M is word-hyperbolic but does not 
admit a regular language of unique representatives and thus, in particular, 
does not admit a word-hyperbolic structure with uniqueness. 

Proof. Let G be the language of left-hand sides of rewriting rules in 31. The 
language G is context-free, and so (A, 31) is a context-free special rewriting 
system. Two left-hand sides of rewriting rules in 31 only overlap if they are 
exactly equal, and so (A, 31) is confluent. Hence, by Theorem 3.1, (A*, M(A*)) 
is a word-hyperbolic structure for the monoid M. So M is word-hyperbolic. 
Identify M with the language of normal form words of (A, 31). 

Suppose for reductio ad absurdum that M admits a word-hyperbolic struc- 
ture with uniqueness. Then, by Proposition 4.1, there is a regular language L 
over A such that (L, M(L)) is a word-hyperbolic structure with uniqueness for 
M. In particular, every element of M has a unique representative in L. Let 21 
be a finite state automaton recognizing L and let n be the number of states in 



Now, if w € L represents u G M, then w =4>^ u: the word u can be obtained 
from w by replacing subwords lying in G by the empty word, which effectively 
means deleting subwords that lie in G. Consider this process in reverse: the 
word w can be obtained from u by inserting words from G. 

If a word from G is inserted between two letters of u, call it a depth-] in- 
serted word. If a word from G is inserted between two letters of a depth-k 
inserted word, it is called a depth- (k + 1 ) inserted word. A word inserted im- 
mediately before the first letter or immediately after the last letter of a depth-k 
inserted word also counts as a depth-k inserted word. See the following ex- 
ample, where for clarity symbols from u are denoted by x: 



Then it is possible to obtain w from u by performing all depth-1 insertions 
first, then all depth-2 insertions, and so on until w is reached. 

Suppose that, in order to obtain w from u, a word ab a c a d G G is inserted 
for some a > n. Let w = w'aw"dw'", where these distinguished letters a and 
d are the first and last letters of this inserted word. Notice that w" =^ b a c a , 
since 



(Of course, w" may or may not contain inserted words of greater depth.) 
Since a exceeds n, the automaton 21 enters the same state immediately af- 
ter reading two different symbols b of this inserted word, say after read- 



depth 2 



depth 1 



depth 1 




x ab abbccdbccd abbccdxx abbbcccdxx. 

V v ' 

depth 1 



w = w'aw"dw" w'ab a c a dw /// =>t? w'w'" u. 
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ing w'apb and w'apbqb. Similarly it enters the same state immediately af- 
ter reading two different symbols c of this inserted word, say after reading 
w'apbqbrc and w'apbqbrcsc. Therefore by the pumping lemma, w factors 
as w'apbqbrcsctdw'" such that 

w / apb(qb) i rc(sc) j tdw /// e L 

for all i, j 6 NU {0}, where the subwords p and q consist of letters b (members 
of this inserted word) and possibly also inserted words of greater depth, the 
subwords s and t consist of letters c (members of this inserted word) and 
possibly also inserted words of greater depth, and the subword r consists of 
some letters b followed by some letters c (members of this inserted word) and 
possibly also inserted words of greater depth. Thus 

where |3i + p2 + $3 + 2 = yi + 72 + 73 + 2 = a. It follows that 

w / apb(qb) i rc(sc) j tdw /// 
w / ab |3 'b(b |32 b) i b |33 c Y3 c(c Y2 c) j c Yl dw'" 
= w / ab a+tp2+1)(i-1) c a+(T2+1)(j-1) dw"'. 

Set i = 72 + 2 and j = P2 + 2 to see that 

w'apb(qbP +1 rc(sc) p2+1 tdw"' e L 

and 

w / apb(qb) Y2+1 rc(sc) |32+1 tdw /// 

=^ w ' Q b«+( P2 + I )(y2+1) c a+(T2+1)(P2+1)^ w "/ 

^w'w" (since a b a+( ^ +,) ^ +1) c a+(l ' 2+,)( ^ ,) dGG) 
^ u. 

So there are two distinct words w and w'apb(qb) Y2+1 rc(sc) 132+1 tdw" in L 
representing the same element u of M. This is a contradiction and so shows 
the falsity of the supposition that the insertion of a word ab a c a d with a > n 
is used in obtaining the representative in L from a normal form word in M. 

Let G' = {ab a c a d : a ^ n}. Then obtaining a word w G L representing 
u € M requires inserting only words from G'cG. 

Now suppose that an insertion of depth greater than n 2 is required to 
obtain w from u. Then w factorizes as w'apaqdrdw", where the first dis- 
tinguished letter a and second distinguished letter d are the first and last 
letters of some inserted word of depth k, and the second distinguished letter 
a and first distinguished letter d are from some inserted word of depth I > k, 
and where the automaton 21 enters the same state after reading the two distin- 
guished letters a and enters the same state after reading the two distinguished 
letters d. (Such a factorization must exist because there are only n 2 possible 
pairs of states, and there are inserted words of depth exceeding n 2 .) Notice 
that aqd e and so apaqdrd aprd e. Then, by the pumping 
lemma, 

w'apapaqdrdrdw" e L, 

but 

w'apapaqdrdrdw" =^ w'apaprdrdw" w'aprdw" w'w" =^>y, u, 
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and so there are two representatives w and w'apapaqdrdrdw" in L of u e 
M. This is a contradiction and so shows the falsity of the assumption that 
insertions of depth greater than n 2 are required to obtain the representative 
in L of a normal form word in M. 

Suppose that, in the process of performing insertions to obtain a represen- 
tative w € L for an element u G M, a word w' k ' is obtained after the insertions 
of depth k have been performed. Suppose further that in performing the in- 
sertions of depth k + 1 , more than n insertions are made between consecutive 
letters of to obtain a word w' l<+1 (The reasoning below also applies if 
w^ is the empty word, which would require k = 0.) Then w' l<+1 ' factors as 

w (k+1 ) = v / abai c a, dab «2 c a2 d . . . ab a h C ah dv", 

where h > n, and each ab at c ai d is a word from G'. Then w factors as 

w = w'api dap2d • • • apndw", 

where w' v', w" =4>^ v", and pt b ai c ai for each i. Then 21 enters 
the same state on reading w'api dap2d • • • apid and w'api dap2d • • • apj d for 
some i < j. So by the pumping lemma, 

q = w'api dap2d- • • apid(api + i d - • • apjd) 2 apj + i d - • • ap^dw" e L. 

But 

q = w'api dap2d- • • aptd(apt + i d - • • apjd) 2 apj + i d - • • ap^dw" 
^^v'ab a 'c ai dab a2 c a2 d--- 

•••ab ai c ai d(ab ai +'c ai+, d--- 

• • • ab a ic a i d) 2 ab a i+i c a j+1 d . . . ab «h c 'XH dv /' ) 

^ v'v" 

and so there are two representatives w and q of the element u e M. This 
contradicts the uniqueness of representatives in L and shows the falsity of 
the supposition that more that n insertions between consecutive letters in the 
process of obtaining a representative in L for an element of M. 

Therefore, to sum up: a representative w in L of an element u of M can be 
obtained by inserting elements of G' to a depth of at most n 2 , with at most n 
consecutive words being inserted between adjacent letters at any stage. Notice 
that the maximum length of words in G ' is 2n + 2. Thus, starting with empty 
word, the after depth 1 insertions, there are at most n(2n + 2) letters; after 
depth 2 insertions, at most n 2 (2n + 2) 2 ; and after depth n 2 insertions, at 

2 2 

most h = n n (2n + 2) n . Similarly, if one starts with a word u and performs 
insertions to obtain its representative in L, at most H new symbols are inserted 
between any adjacent pair of letters in u. 
Define 

H = {w G A* : |w| ^ h,w e}. 

Then, by the observations in the last paragraph, if u G M with u = ui • • • u n 
is represented by w G L, then w G Hui Hu2 • • • Hu n H. Define the rational 
relation 

T=({(a,a):aeA)U{(p )£ ):peH})'. 
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Then, since removing all subwords in H from a word in L yields the word to 
which it rewrites, it follows that 

M = (LoT) n (A* — A*HA*) = {u G A* — A*HA* : (3w € L)((w,u) € 5 )}, 

and so M, which is the language of normal forms of (A, 01), is regular. 

However, two words ab a c' 3 d and ab a c' 3 d (where a, (3, a', (3' € N) rep- 
resent the same element of M if and only if a = (3 and a' = (3', in which 
case they both represent the identity of M. Thus, since in M the unique rep- 
resentative of the identity is e, the language K = ab*c*d — M, which is also 
regular, consists of precisely those words of the form ab oc c' 3 d that represent 
the identity. That is, the language K is {ab a c a d : a G N}, which is not regular 
by the pumping lemma. This is a contradiction, and so M does not admit a 
regular language of unique representatives. □ 
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